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Description 

[0001] The invention relates to parametric audio coding. 

[0002] Heiko Purnhagen, 'Advances in parametric audio coding', Proc. 1999 IEEE Workshop on Applications of Signal 
5 Processing to Audio and Acoustics, New Paltz, New York, Oct. 1 7-20, 1 999 discloses that parametric modeling provides 
an efficient representation of general audio signals and is utilized In very low bit rate audio coding. It is based on the 
decomposition of an audio signal into components which are described by appropriate source models and represented 
by model parameters (like the frequency and amplitude of a pure tone). Perception models are utilized in signal decom- 
position and model parameter coding. 
10 [0003] An object of the invention is to provide an advantageous parameterization of a multi-channel (e.g. stereo) audio 
signal. To this end, the invention provides a method of encoding, an encoder an encoded audio signal, a storage medium, 
a method of decoding and a decoder as defined in the independent claims. Advantageous embodiments are defined in 
the dependent claims. 

[0004] It is noted that stereo audio coding as such is known in the prior art. For example, the two channels left (L) and 
15 right (R) may be coded independently. This may be done by two independent encoders arranged in parallel or by time 
multiplexing in one encoder. Usually, one can code the two channels more efficiently by using cross-channel correlation 
(and irreievancies) in the signal. Reference is made to the IVIPEG-2 audio standard (ISO/IEC 13818-3, pages 5,6) which 
discloses joint stereo coding. Joint stereo coding exploits the redundancy between left and right channels in order to 
reduce the audio bit-rate. Two forms of joint stereo coding are possible: MS stereo and intensity stereo. MS stereo is 
20 based on coding the sum (L-i-R) and the difference (L-R) signal instead of the left (L) and right (R) channels. Intensity 
coding is based on retaining at high frequencies only the energy envelope of the right (R) and left (L) channels. Direct 
application of the MS stereo coding principle in parametric coding instead of in subband coding would result in a param- 
eterized sum signal and a parameterized difference signal. The forming of the sum signal and the difference signal 
before encoding might give rise to the generation of additional frequency components in the audio signal to be encoded 
25 which reduces the efficiency of the parametric coding. Direct application of the intensity stereo coding principle on a 
parametriccoding scheme would result in a lowfrequency part with independently encodedchannels and a high frequency 
part that includes only the energy envelope of the right and left channels. 

[0005] According to a first aspect of the invention, common frequencies are determined in the at least two channels 
of the audio signal, which common frequencies occur in at least two of the at least two channels, and respective sinusoidal 

30 components in respective channels at a given common frequency are represented by a representation of the given 
common frequency, and a representation of respective amplitudes of the respective sinusoidal components at the given 
common frequency. This aspect is based on the insight that a given frequency generated by a given source has a high 
probability to have a component in each of the channels. These signal components will have their frequency in common. 
This is true because signal transformations that may occur in the transmission from sound source via recording equipment 

35 to the listener will usually not affect frequency components differentially in the various or all channels. Thus, common 
components in the various signal channels can be represented by a single, common frequency. The respective amplitudes 
(and phases) of the respective components in the respective channels may differ. Thus, by coding the sinusoids with a 
common frequency and a representation of the respective amplitudes, an efficient compressive coding of the audio 
signal is achieved; only one parameter is needed to encode a given common frequency (which occurs in various channels). 

40 Further, such a parameterization is advantageously applied with a suitable psycho-acoustic model. 

[0006] Once a common frequency has been found, the other parameters describing the components in each respective 
channel can be represented. For example, for a stereo signal that is represented with sinusoidal components, the mean 
and the difference of the amplitudes (and optionally the respective phases) can be coded. In a further embodiment, the 
largest amplitude is encoded in the coded audio stream together with a difference amplitude, wherein the sign of the 

45 difference amplitude may determine the dominant channel for this frequency. 

[0007] Since there is likely to be some degree of correlation between the left and the right channels, entropy coding 
of the sinusoidal parameters can be used which will result in more efficient encoding of the stereo signal. In addition, 
irrelevant information within the common component representation can be removed, e.g. interaural phase differences 
at high frequencies are inaudible and can be set to zero. 

50 [0008] It is possible to encode any frequency occurring in the channels as acommon frequency. If afrequency occurring 
in one channel does not occur in another channel, the amplitude representation should then be encoded such as to 
result in a zero amplitude for the channel in which the frequency does not occur. For example if in a multi-channel 
application afrequency occurs in 3 of the 4 channels, then the frequency can be encoded as acommon frequency while 
making the amplitude zero in the channel in which the frequency does not occur. 

55 [0009] Non-common frequencies may also be represented as independent sinusoids in the respective channels. 
Non-common frequencies can be encoded in a separate parameter block. It is further possible to produce afirst parameter 
block including common frequencies which common frequencies are common to all channels, a second parameter block 
which includes frequencies which are common to a (predetermined) subset of all channels, a third parameter block 
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which includes frequencies which are connmon to a further (predetenTiined) subset of all channels, and so on until a last 
parameter block which includes the frequencies which occur in only one channel and which are independently coded. 
[0010] A common frequency may be represented as an absolute frequency value but also as a frequency changing 
overtime, e.g. a first derivative 8f/3t. Further, the common frequencies may be differentially encoded relative to other 
5 common frequencies. 

[0011] Common frequencies can be found by estimating frequencies by considering two or more channels at the same 
time. 

[0012] In a first embodiment, frequencies are separately determined for the respective channels followed by a com- 
parison step to determine the common frequencies. The determination of the frequencies occurring in the respective 
^0 channels may be performed by a conventional matching pursuit (see e.g. S.G. Mallat and Z. Zhang, "Matching pursuits 
with time-frequency dictionaries," IEEE trans, on Signal Processing, vol. 41, no. 12, pp. 3397-3415) or peak picking 
(see e.g. R. McAulay and T. Quatieri, "Speech Analysis/Synthesis Based on a Sinusoidal Representation," IEEE Trans. 
ASSP, Vol. 34, No. 4, pp. 744-754, Aug. 1986). 

[0013] In a second embodiment for determining the common frequencies, a combined matching pursuit algorithm is 
15 employed. For example, respective power or energy representations of the at least two channels are combined to obtain 
a common representation. The common frequencies are then determined based on the common representation. Pref- 
erably, the power spectra of the at least two channels are added to obtain a common power spectrum. A conventional 
matching pursuit is used to determine the frequencies in this added spectrum. The frequencies found in this added power 
spectrum are determined to be common frequencies. 
20 [001 4] In a third embodiment for detemnining the common frequencies, peak picking in added power spectra is used. 
The frequencies of the maxima that are found in this common power spectrum can be used as the common frequencies. 
One could also add log-power spectra instead of linear power spectra. 

[0015] Preferably, the phase of the respective components of the common frequency is also encoded. A common 
phase, which may be the average phase of the phases in the channels or the phase of the channel with the largest 

25 amplitude, and a difference phase (interchannel) may be included in the coded audio signal. Advantageously, the 
difference phase is only encoded up to a given threshold frequency (e.g. 1 .5 kHz or 2 kHz). For frequencies higher than 
this threshold, no difference phase is encoded. This is possible without reducing the quality significantly, because human 
sensitivity to interaural phase differences is low for frequencies above this threshold. Therefore, a difference phase 
parameter is not necessary for frequencies above the given threshold. Upon decoding, the delta phase parameter can 

30 be assumed to be zero for frequencies above the threshold. The decoder is arranged to receive such signals. Above 
the threshold frequency the decoder does not expect any codes for difference phases. Because the difference phases 
are in practical embodiment not provided with an identifier, it is important for the decoder to know when to expect 
difference phases and when not. Further, because the human ear is less sensitive to large interaural intensity differences, 
delta amplitudes which are larger than a certain threshold, e.g. 10 dB, can be assumed infinite. Consequently, also in 

35 this case no interaural phase differences need to be encoded. 

[0016] Frequencies in different channels differing less than a given threshold may be represented by a common 
frequency. In this case it is assumed that the differing frequencies originate from the same source frequency. In practical 
embodiments, the threshold is related to the accuracy of the matching pursuit or peak-picking algorithm. 
[0017] In practical embodiments, the parameterization according to the invention is employed on frame-basis. 

40 [0018] The invention is applicable to any audio signal, including speech signals. 

[001 9] These and other aspects of the invention will be apparentfrom the elucidated with reference to the accompanying 
drawings. 

[0020] In the drawings: 



45 Fig. 1 shows an encoder according to an embodiment of the invention; 

Fig. 2 shows a possible implementation of the encoder of Fig. 1 ; 
Fig. 3 shows an alternative implementation of the encoder of Fig. 1 , and 
Fig. 4 shows a system according to an embodiment of the invention. 



50 [0021] The drawings only show those elements that are necessary to understand the embodiments of the invention. 
[0022] Fig. 1 shows an encoder 1 1 according to an embodiment of the invention. A multi-channel audio signal is input 
to the encoder. In this embodiment the multi-channel audio signal is a stereo audio signal having a left channel L and 
a right channel R. The encoder 1 1 has two inputs: one input for the left channel signal L and another input for the right 
channel signal R. Alternatively, the encoder has one input for both channels L and R which are in that case furnished 

55 in a multiplexed form to the encoder 1 1 . The encoder 1 1 extracts sinusoids from both channels and determines common 
frequencies fcom- The result of the encoding process performed in the encoder 11 is an encoded audio signal. The 
encoded audio signal includes the common frequencies f^orn P^i* common frequency l^om ^ representation of the 
respective amplitudes in the respective channels, e.g. in the form of a maximum or average amplitude A and a difference 
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(delta) amplitude AA. 

[0023] In the following, it is described how the comnnon frequencies nnay be determined, a first embodiment employing 
a matching pursuit and second embodiment employing peak-picking. 

An embodiment employing 'matching pursuit' 

[0024] This method is an extension of existing matching pursuit algorithms. Matching pursuits are well-known in the 
art. A matching pursuit is an iterative algorithm. It projects the signal onto a matching dictionary element chosen from a 
redundant dictionary of time-frequency waveforms. The projection is subtracted from the signal to be approximated in 
the next iteration. Thus in existing matching pursuit algorithms, the parameterization is peri'ormed by iteratively deter- 
mining a peak of the 'projected' power spectrum of a frame of the audio signal, deriving the optimal amplitude and phase 
corresponding to the peak frequency, and extracting the corresponding sinusoid from the frame under analysis. This 
process is iteratively repeated until a satisfactory parameterization of the audio signal is obtained. To derive common 
frequencies in a multi-channel audio signal, the power spectra of the left and right channels are added and the peaks 
of this sum power spectrum are determined. These peak frequencies are used to determine the optimal amplitudes and 
optionally the phases of the left and the right (or more) channels. 

[0025] The multi-channel matching pursuit algorithm according to a practical embodiment of the invention comprises 
the step of splitting the multi-channel signal into short-duration (e.g. 10 ms) overlapping frames, and applying iteratively 
the following steps on each of the frames until a stop criterion has been met: 

1 . The power spectra of each of the channels of the multi-channel frame are calculated 

2. The power spectra are added to obtain a common power spectrum 

3. The frequency at which the common 'projected' power spectrum is maximum is determined 

4. For the frequency determined in step 3, for each channel, the amplitude and phase of the best matching sinusoid 
are determined and all these parameters are stored. These parameters are encoded using the common frequencies 
in combination with a representation of the respective amplitudes thereby exploiting cross-channel correlations and 
irrelevancies. 

5. The sinusoids are subtracted from the corresponding cun-ent multi-channel frames to obtain an updated residual 
signal which serves as the next multi-channel frame in step 1 . 

Embodiment using 'peak picking' 

[0026] Alternatively, peak picking may be used, e.g. including the following steps: 

1 . The power spectra of each of the channels of the multi-channel frame are calculated 

2. The power spectra are added to obtain a common power spectrum 

3. The frequencies corresponding to all peaks within the power spectrum are determined 

4. For these determined frequencies, the best amplitudes and best phases are obtained 

[0027] Fig. 2 shows a possible implementation of the encoder of Fig. 1 , which makes use of a common (added) power 
spectrum of the channels to determine the common frequencies. In calculation unit 110 a matching pursuit process or 
a peak picking process is performed as described above by using a common power spectrum obtained from the L and 
R channels. The detemnined common frequencies f^om are furnished to coding unit 111. This coding unit determines 
the respective amplitudes of the sinusoids (and preferably the phases) in the various channels at a given common 
frequency. 

[0028] Alternatively, the respective channels are independently encoded to obtain a set of parameterized sinusoids 
for each channel. These parameters are thereafter checked for common frequencies. Such an embodiment is shown 
in Fig. 3. Fig. 3 shows an alternative implementation of the encoder 1 1 of Fig. 1 . In this implementation, the encoder 1 1 
comprises two independent parametric encoders 112 and 113. The parameters f^_, A|_ and f^, Ar obtained in these 
independent coders are furnished to a further coding unit 1 1 4 which determines the common frequencies f^o^ in these 
two parameterized signals. 

Example of coding a stereo audio signal 



[0029] Assume that a stereo audio signal is given with the following characteristics: 
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channel 


f(Hz) 


A(dB) 


f(Hz) 


A(dB) 


f(Hz) 


A(dB) 


f(Hz) 


A(dB) 


f(Hz) 


A(dB) 


L 


50 


30 


100 


50 


250 


40 






500 


40 


R 


50 


20 


100 


60 






200 


30 


500 


35 



In practice, in tlie case tine amplitude difference between the channels is +15 dB or -15dB at a given frequency, this 
frequency is considered to occur only in the dominant channel. 

Independeretly coded 

[0030] The following parameterization can be used to code the exemplary stereo signal independently. 

L(f,A) = (50,30), (100,50), (250,40), (500,40) 
R(f,A) = (50,20), (100,60), (200,30), (500,35) 
^5 This parameterization requires 1 6 parameters. 

Using common frequencies and non-common frequencies 

[0031] Common frequencies are 50 Hz, 1 00 Hz and 500 Hz. To code this signal: 
20 (Fcom.Amax- AA) = (50,30,1 0), (1 00,60,-1 0), (500,40,5) 
(Fnon-comA) = (200,-30), (250,40) 

Coding the exemplary stereo audio signal using common and non-common frequencies requires 13 parameters in this 
example. Compared to the independently coded multi-channel signal, the use of common frequencies reduces the 
number of coding parameters. Further, the values for the delta amplitude are smaller than for the absolute amplitudes 

25 as given in the independently coded multi-channel signal. This further reduces the bit-rate. 

[0032] The sign in the delta amplitude AA determines the dominant channel (between two signals). In the above 
example, a positive amplitude means that the left channel Is dominant. The sign can also be used in the non-common 
frequency representation to indicate for which signal the frequency is valid. Same convention is used here: positive is 
left (dominant). It is alternatively possible to give an average amplitude in combination with a difference amplitude, or 

jQ consistently the amplitude of a given channel with a difference amplitude relative to the other channel. 

[0033] Instead of using the sign in the delta amplitude AA to detennine the dominant channel, it is also possible to 
use a bit in the bit-stream to indicate the dominant channel. This requires 1 bit as may also be the case for the sign bit. 
This bit is included in the bit-stream and used in the decoder. In the case that an audio signal is encoded with more than 
two channels, more than 1 bit is needed to indicate the dominant channel. This implementation is straightforward. 

35 

Use of only common frequencies 

[0034] When only a representation based on common frequencies is used, the non-common frequencies are coded 
such that the amplitude of the common frequency in the channel in which no sinusoid occurs at that frequency is zero. 
4Q In practice, a value of e.g. -f15 dB or -1 5 dB for the delta amplitude can be used to indicate that no sinusoid of the current 
frequency is present in the given channel. The sign In the delta amplitude AA determines the dominant channel (between 
two signals). In this example, a positive amplitude means that the left channel is dominant. 
(fcowA' ^) = (50,30,10), (100,60,-10), (200,30,-15), (250,40,15), (500,40,5) 

This parameterization requires 15 parameters. For this example, the use of only common frequencies is less advanta- 
45 geous than the use of common and non-common frequencies. 

Frequency average and differences 

[0035] (Fav, AF,Aav, AA) = (50,0,25,5), (1 00,0,55,-5), (225,25,35,5), (500,0,30,1 0) 
5Q This parameterization requires 1 6 parameters. 

This is an alternative encoding wherein the sinusoidal components in the signal are represented by average frequencies 

and average amplitudes. It is clear that also compared with this coding strategy, the use of common frequencies is 

advantageous. It is noted that the use of average frequencies and average amplitudes can be seen as a separate 

invention outside the scope of the current application. 
55 [0036] It is noted that not strictly the number of parameters but rather the sum of the number of bits per parameter is 

of importance for the bit-rate of the resulting coded audio stream. In this respect, differential coding usually provides a 

bit-rate reduction for correlated signal components. 

[0037] The representation with a common frequency parameter and respective amplitudes (and optionally respective 



5 



EP 1 479 071 B1 



phases) can be regarded as a mono representation, captured in the parameters common frequency, average or maximum 
amplitude, phase of the average or maximum amplitude (optional) and a multi-channel extension captured in the pa- 
rameters delta amplitude and delta phase (optional). The mono parameters can be treated as standard parameters that 
one would get in a mono sinusoidal encoder. Thus, these mono parameters can be used to create links between sinusoids 

5 in subsequent frames, to encode parameters differentially according to these links and to perform phase continuation. 
The additional, multi-channel parameters can be encoded according to strategies mentioned above which further exploit 
binaural hearing properties. The delta parameters (delta amplitude and delta phase) can also be encoded differentially 
based on the links that have been made based on the mono parameters. Further, to provide a scalable bit-stream, the 
mono parameters may be included in a base layer, whereas the multi-channel parameters are included in an enhancement 

^0 layer. 

[0038] In the tracking of the mono components, the cost function (or similarity measure) is a combination of the cost 
for the frequency, the cost for the amplitude and (optionally) the cost for the phase. For stereo components, the cost 
function may be a combination of the cost for the common frequency, the cost for the average or maximum amplitude, 
the cost for the phase, the cost for the delta amplitude and the cost for the delta phase. Alternatively, one may use for 
15 the cost function for stereo components: the common frequency, the respective amplitudes and the respective phases. 
[0039] Advantageously, the sinusoid parameterization using a common frequency and a representation of the respec- 
tive amplitudes of that frequency in the respective channels is combined with a mono transient parameterization such 
as disclosed in WO 01/69593-A1. This may further be combined with a mono representation for the noise such as 
described in WO 01/88904. 

20 [0040] Although most of the embodiments described above relate to two-channel audio signals, the extension to three 

or more channel audio signals is straightforward. 

[0041] Addition of an extra channel to an already encoded audio signal can advantageously be done as follows: it 
suffices to identify in the encoded audio signal that an additional channel is present and to add to the encoded audio 
signal a representation of the amplitudes of the common frequencies present in the extra channel and a representation 
25 of the non-common frequencies. Phase information can optionally be included in the encoded audio signal either. 

[0042] In a practical embodiment, the average or maximum amplitude and the average phase of the largest amplitude 
at a common frequency are quantized similar to the respective quantization of the delta amplitude and the delta phase 
at the common frequency for the other channel(s). Practical values for the quantization are: 

30 common frequency resolution of 0.5 % 

amplitude, delta amplitude resolution of 1 dB 
phase, delta phase resolution of 0.25 rad 

35 The proposed multi-channel audio encoding provides a reduction of the bit rate when compared to encoding the channels 
independently. 

[0043] Fig. 4 shows a system according to an embodiment of the invention. The system comprises an apparatus 1 
for transmitting or storing an encoded audio signal [S]. The apparatus 1 comprises an input unit 10 for receiving an at 
least two-channel audio signal S. The input unit 10 may be an antenna, microphone, network connection, etc. The 

4Q apparatus 1 further comprises the encoder 1 1 as shown in Fig. 1 for encoding the audio signal S to obtain an encoded 
audio signal with a parameterization according to the current invention, e.g. (fcom' ^av i^com^ ^n\ax' ^^)- '^^^ 

encoded audio signal parameterization is furnished to an output unit 12 which transforms the encoded audio signal in 
a suitable format [S] for transmission or storage via a transmission medium or storage medium 2. The system further 
comprises a receiver or reproduction apparatus 3 which receives the encoded audio signal [S] in an input unit 30. The 

45 input unit 30 extracts from the encoded audio signal [S] the parameters (fcom' ^) (^com' ^max' These 
parameters are furnished to a decoder 31 which synthesizes a decoded audio signal based on the received parameters 
by generating the common frequencies having the respective amplitudes in order to obtain the two channels L and R of 
the decoded audio signal S'. The two channels L and R are furnished to an output unit 32 that provides the decoded 
audio signal S'. The output unit 32 may be reproduction unit such as a speaker for reproducing the decoded audio signal 

50 S'. The output unit 32 may also be a transmitter for further transmitting the decoded audio signal S' for example over 
an in-home network, etc. 

[0044] It should be noted that the above-mentioned embodiments illustrate rather than limit the invention, and that 
those skilled in the art will be able to design many alternative embodiments without departing from the scope of the 
appended claims. In the claims, any reference signs placed between parentheses shall not be construed as limiting the 
55 claim. This word 'comprising' does not exclude the presence of other elements or steps than those listed in a claim. The 
invention can be implemented by means of hardware comprising several distinct elements, and by means of a suitably 
programmed computer. In a device claim enumerating several means, several of these means can be embodied by one 
and the same item of hardware. The mere fact that certain measures are recited in mutually different dependent claims 
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does not indicate that a combination of these nneasures cannot be used to advantage. 



Claims 

5 

1. A method of encoding (11) an at least two-channel audio signal (UR), the method comprising: 

determining (110) common frequencies (fcom) '"^ least two channels (L,R) of the audio signal, which 

common frequencies occur in at least two of the at least two channels of the audio signal, and 
^0 representing (111) respective sinusoidal components in respective channels at a given common frequency by 

a representation of the given common frequency (fcom) ^ representation of respective amplitudes (A,AA) 
of the respective sinusoidal components at the given common frequency. 

2. A method of coding as claimed in claim 1 , wherein the representation of the respective amplitudes (A,AA) comprises 
15 an average amplitude (A) and a difference amplitude (AA). 

3. A method of coding as claimed in claim 1 , wherein the representation of the respective amplitudes (A,AA) comprises 
a maximum amplitude (A) and a difference amplitude (AA). 

20 4. A method of coding as claimed in claim 1 , wherein non-common frequencies are coded as common frequencies, 
wherein the amplitude representation includes an indication for indicating the at least one channel in which the 

frequency does not occur. 

5. A method of coding as claimed in claim 1 , wherein in addition to the common frequencies, non-common frequencies 
25 are coded independently. 

6. A method as claimed in claim 5, wherein the non-common frequencies are grouped in the coded audio stream in a 
separate block. 

30 7. A method as claimed in claim 6, wherein the common frequencies are grouped and included in the encoded audio 
signal preceding to the block of non-common frequencies. 

8. A method as claimed in claim 6, wherein the parameters of the sinusoidal components at the common frequencies 
are included in abase layer andthe parameters of the sinusoids at non-common frequencies in an enhancement layer. 

35 

9. A method as claimed in claim 1 , wherein the method comprises the step of combining respective power or energy 
representations of the at least two channels to obtain a common representation and wherein the step of determining 
the common frequencies is performed based on the common representation. 

40 10. A method as claimed in claim 9, wherein the combining step includes adding power spectra of the at least two 
channels and wherein the common representation is a common power spectrum. 

11. A method as claimed in claim 1 , wherein frequency and amplitude parameters are included in a base layer and the 
delta amplitude is included in an enhancement layer. 

45 

12. A method as claimed in claim 1, wherein respective phases of the respective sinusoids at the given common 
frequency are determined and wherein a representation of the respective phases is included in the encoded audio 
signal. 

50 13. A method as claimed in claim 12, wherein the representation of the respective phases includes an average phase 
and a difference phase. 

14. A method as claimed in claim 12, wherein the representation of the respective phases includes a phase of the 
channel with a largest amplitude, and a difference phase. 



55 



15. A method as claimed in claim 12, wherein the representation of the respective phases is only included in the signal 
for sinusoids having a frequency up to a given threshold frequency. 
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16. A method as claimed in claim 15, wherein the given threshold frequency is about 2 kHz. 

17. A method as claimed in claim 1 2, wherein the representation of the respective phases is only included in the signal 
for sinusoids having an amplitude difference with at least one of the other channels up to a given amplitude threshold. 

5 

18. A method as claimed In claim 17, wherein the given amplitude threshold is 10 dB. 

19. An encoder (11) for encoding an at least two-channel audio signal (L,R), the encoder comprising: 

^0 means (1 10) for determining common frequencies (fcom) '"^ the at least two channels (L,R) of the audio signal, 

which common frequencies occur in at least two of the at least two channels of the audio signal, and 
means (111) for representing respective sinusoidal components in respective channels at a given common 
frequency by a representation of the given common frequency (fcom) ^ representation of respective ampli- 
tudes (A,AA) of the respective sinusoidal components at the given common frequency. 

15 

20. An apparatus (1) for transmitting or recording, the apparatus comprising 

an input unit (10) for receiving an at least two-channel (L,R) audio signal (S), 

an encoder (11 ) as claimed in claim 1 9 for encoding the audio signal (S) to obtain an encoded audio signal ([S]), and 
an output unit for providing the encoded audio signal ([S]). 

20 

21. An encoded audio signal ([S]) representing an at least two-channel audio signal (L,R), the encoded audio signal 
comprising: 

representations of common frequencies (fcom)' which common frequencies represent frequencies which occur 
25 in at least two of the at least two channels of the audio signal [S], and 

for a given common frequency (fcom)' ^ representation of respective amplitudes (A,AA) representing respective 
sinusoidal components in respective channels at the given common frequency. 

22. A storage medium (2) having stored thereon a signal as claimed in claim 21 . 

30 

23. A method of decoding (31) an encoded audio signal ([S]), the method comprising: 

receiving (31) the encoded audio signal ([S]) representing an at least two-channel audio signal (L,R), the encoded 
audio signal comprising representations of common frequencies (fcom)» which common frequencies represent 
35 frequencies which occur in at least two of the at least two channels of the audio signal [S], and for a given 

common frequency (fcom)' ^ representation of respective amplitudes (A,AA) representing respective sinusoidal 
components in respective channels at the given common frequency, and 

generating (31) the common frequencies at the respective amplitudes in the at least two channels (L,R) to obtain 
a decoded audio signal (S'). 

40 

24. A decoder (31) for decoding an encoded audio signal ([S]), the decoder comprising: 

means (31) for receiving the encoded audio signal ([S]) representing an at least two-channel audio signal (L,R), 
the encoded audio signal comprising representations of common frequencies (fcom)' which common frequencies 
45 represent frequencies which occur in at least two of the at least two channels of the audio signal [S], and for a 

given common frequency (fcom)' ^ representation of respective amplitudes (A,AA) representing respective si- 
nusoidal components in respective channels at the given common frequency, and 

means (31) for generating the common frequencies at the respective amplitudes in the at least two channels 
(L,R) to obtain a decoded audio signal (S'). 

50 

25. A receiver or reproduction apparatus (3), the apparatus comprising: 

an input unit (30) for receiving an encoded audio signal ([S]), 

a decoder (31) as claimed in claim 24 to decode the encoded audio signal ([S]) to obtain a decoded audio signal 
55 (S'), and 

an output unit (32) to provide the decoded audio signal (S'). 



8 



EP 1 479 071 B1 



Patentanspruche 

1. Verfahren zum Codieren (11) eines wenigstens Zweikanal-Audiosignals (UR), wobei dieses Verfahren die nachfol- 
genden Verfahrensschritte umfasst: 

- das Bestimmen (110) gemeinsamer Frequenzen in den wenigstens zwei KanSlen (L,R) des Audiosignals, 
wobei diese gemeinsamen Frequenzen in wenigstens zwei der wenigstens zwei Kanale des Audiosignals auf- 

treten, und 

- das Darstellen (111) betreffender sinusformiger Anteile in den betreffenden Kanalen bei einer bestimnnten 
Frequenz durcli eine Darstellung der bestimnnten gemeinsamen Frequenz (fcom) und eine Darstellung der 
betreffenden Amplituden (A, AA) der betreffenden sinusformigen Anteile bei der betreffenden gemeinsamen 
Frequenz. 

2. Verfahren zum Codieren nach Ansprucli 1, wobei die Darstellung der betreffenden Amplituden (A,AA) eine mittlere 
Amplitude (A) und eine Drfferenzamplitude (AA) aufweist. 

3. Verfahren zum Codieren nach Anspruch 1 , wobei die Darstellung der betreffenden Amplituden (A, AA) eine maximale 
Amplitude (A) und eine Drfferenzamplitude (AA) aufweist. 

4. Verfahren zum Codieren nach Anspruch 1, wobei nicht gemeinsame Frequenzen als gemeinsame Frequenzen 
codiert werden, wobei die Amplitudendarstellung eine Angabe umfasst zum Angeben des wenigstens einen Kanals, 

in dem die Frequenz nicht auftritt. 

5. Verfahren zum Codieren nach Anspruch 1 , wobei zusatzllch zu den gemeinsamen Frequenzen, nicht gemeinsame 
Frequenzen unabhangig codiert werden. 

6. Verfahren nach Anspruch 5, wobei die nicht gemeinsamen Frequenzen In dem codlerten Audiostrom In einem 
separaten Block gruppiert werden. 

7. Verfahren nach Anspruch 6, wobei die gemeinsamen Frequenzen gruppiert werden und in dem codierten Audiosignal 
vor dem Block nicht gemeinsamer Frequenzen eingesch lessen werden. 

8. Verfahren nach Anspruch 6, wobei die Parameter der sinusformigen Anteile mit den gemeinsamen Frequenzen in 
einer Basisschicht eingeschlossen sind und die Parameter der Sinuskurven mit nicht gemeinsamen Frequenzen in 
einer Anreicherungsschicht eingeschlossen sind. 

9. Verfahren nach Anspruch 1, wobei das Verfahren den nachfolgenden Schritt umfasst: das Komblnieren der Lei- 
stungs- bzw. Energiedarstellung der wenigstens zwei Kanale zum Erhalten einer gemeinsamen Darstellung und 
wobei der Schritt der Bestimmung der gemeinsamen Frequenzen auf Basis der gemeinsamen Darstellung durch- 
geruhrt wird. 

10. Verfahren nach Anspruch 9, wobei der Komblnierschritt das Hinzufugen von Leistungsspektren der wenigstens 
zwei Kanale umfasst und wobei die gemeinsame Darstellung ein gemeinsames Leistungsspektrum ist. 

11. Verfahren nach Anspruch 1 , wobei Frequenz- und Amplltudenparameter in einer Basisschicht vorgesehen sind und 
die Dettaamplitude in einer Anreicherungsschicht vorgesehen ist. 

12. Verfahren nach Anspruch 1, wobei die betreffenden Phasen der betreffenden Sinuskurven bei der gestimmten 
gemeinsamen Frequenz bestimmt werden und wobei eine Darstellung der betreffenden Phasen in dem codierten 

Audiosignal eingeschlossen ist. 

13. Verfahren nach Anspruch 12, wobei die Darstellung der betreffenden Phasen eine mittlere Phase und eine Diffe- 
renzphase umfasst. 

14. Verfahren nach Anspruch 12, wobei die Darstellung der betreffenden Phasen eine Phase des Kanals mit einer 
groBten Amplitude und einer Differenzphase umfasst. 

15. Verfahren nach Anspruch 12, wobei die Darstellung der betreffenden Phasen nur in dem Signal fur Sinuskurven 
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mit einer Frequenz bis zu einer bestimmten Schwellenfrequenz eingesch lessen sind. 

16. Verfahren nach Anspruch 15, wobei die bestimmte Schwellenfrequenz etwa 2 kHz betragt. 

17. Verfahren nach Anspruch 12, wobel die Darstellung der betreffenden Phasen nur In dem Signal fur SInuskurven 
mIt einer Annplitudendlfferenzmltwenlgstens einem deranderen Kanaieblszu einer bestlnnmtenAmplitudenschwelle 
eingesch lessen ist. 

18. Verfahren nach Anspruch 17, wobei die bestinnmte Amplitudenschwelle 10 dB betragt. 

1 9. Codierer (1 1 ) zunn Codieren eines wenigstens Zweikanal-Audiosignals (L, R), wobei der Codierer Folgendes unnfasst: 

- Mittel (110) zum Bestimmen genneinsamer Frequenzen (fcom) '"^ wenigstens zwei Kanalen (L,R) des 
Audiosignals, wobei die gemeinsamen Frequenzen in wenigstens zwel der wenigstens zwel Kanale des Au- 
dlosignals auftreten, und 

- Mittel (111) zunn Darstellen betreffender sinusformiger Anteile in den betreffenden Kanalen mit einer bestinnm- 
ten gemeinsannen Frequenz durch eine Darstellung der bestinnmte n gemeinsamen Frequenz (fcom) ^ind durch 
eine Darstellung der betreffenden Amplltuden (A, AA) der betreffenden sinusfonnlgen Anteile mIt der bestimmten 
gemeinsamen Frequenz. 

20. Anordnung (1) zum Ubertragen oder Aufzelchnen, wobel die Anordnung Folgendes umfasst: 

- eine Eingangseinheit (10) zum Empfangen eines wenigstens Zweikanal (L,R)-Audiosignals (S), 

- einen Codierer (11) nach Anspruch 19 zum Codieren des Audiosignals (S) zum Erhalten eines codlerten 
Audiosignals ([S]), und 

- eine Ausgangseinhelt zum Llefern des codlerten Audiosignals ([S]). 

21. Codiertes Audiosignal ([S]), das ein wenigstens Zweikanal-Audioslgnal (L,R) darstellt, wobel das codlerte Audiosl- 
gnal Folgendes umfasst: 

- Darstellungen von gemeinsamen Frequenzen (fcom)' wobei diese gemeinsamen Frequenzen Frequenzen 
darstellen, die in wenigstens zwei der wenigstens zwei Kanale des Audiosignals (S) auftreten, und 

- fur eine bestimmte gemelnsame Frequenz (fcom) s'l^® Darstellung der betreffenden Amplltuden (A,AA), die 
betreffende sinusformlge Anteile In den betreffenden Kanalen mit der bestimmten gemeinsamen Frequenz 
darstellen. 

22. Speichermedium (2), auf dem ein Signal gespeichert ist, wie in Anspruch 21 beansprucht. 

23. Verfahren zum Decodleren (31) eines codlerten Audiosignals ([S]), wobel das Verfahren die nachfolgenden Ver- 
fahrensschritte umfasst: 

- das Empfangen (31) des codlerten Audiosignals ([S]), das ein wenigstens Zweikanal-Audioslgnal ((L,R) dar- 
stellt, wobel das codierte Audiosignal Darstellungen von gemeinsamen Frequenzen (fcom) aufwelst, wobei diese 

gemeinsamen Frequenzen Frequenzen darstellen, die in wenigstens zwei der wenigstens zwei Kanale des 
Audiosignals (S) auftreten, und fur eine bestimmte gemeinsame Frequenz (fcom)' ^^^^ Darstellung der betref- 
fenden Amplituden (A,AA), die betreffende sinusformlge Komponenten In betreffenden Kanalen mIt der be- 
stimmten gemeinsamen Frequenz darstellen, und 

- das Erzeugen (31) der gemeinsamen Frequenzen mit den betreffenden Amplltuden In den wenigstens zwei 
Kanalen (L,R) zum Erhalten eines decodierten Audiosignals (S'). 

24. Decoder (31) zum Decodleren eines codlerten Audiosignals ([S]), wobel der Decoder Folgendes umfasst: 

- Mittel (31) zum Empfangen des codlerten Audiosignals ([S]), das ein wenigstens Zweikanal-Audioslgnal (L,R) 
aufwelst, wobei das codierte Audiosignal Darstellungen gemeinsamer Frequenzen (fcom) aufweist, wobei diese 
gemeinsamen Frequenzen Frequenzen darstellen, die in wenigstens zwei der wenigstens zwei Kanale des 
Audiosignals (S) auftreten, und fur eine bestimmte gemeinsame Frequenz (fcom)- sine Darstellung betreffender 
Amplituden (A, AA), die betreffende sinusformlge Anteile in den betreffenden Kanalen mIt der bestimmten 
gemeinsamen Frequenz darstellen, und 
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- Mittel (31) zum Erzeugen der gemeinsamen Frequenzen mit den betreffenden Amplituden in den wenigstens 
zwei Kanalen (L,R) zum Erhalten eines decodierten Audiosignals (S'). 

25. Empfanger Oder Wiedergabeanordnung (3), wobei das Gerat Folgendes umfasst: 

- eine Eingangseinheit (30) zum Empfangen eines codierten Audiosignals ([S]), 

- einen Decoder (31) nach Anspruch 24 zum Decodieren des codierten Audiosignals ([S]) zum Eriiatten eines 

decodierten Audiosignals (S'), und 

- eine Ausgangseinheit (32) zum Liefern des decodierten Audiosignals (S'). 
Revendications 

1. Precede decodage (11) d'un signal audio au moins a deux voles (L, R), le precede comprenant les etapes suivantes 
15 consistant a: 

determiner (1 1 0) des frequences communes (fcom) ^^^^ '^^ au moins deuxvoies (L, R) du signal audio, lesquelles 
frequences communes se presentent dans au moins deux des au moins deuxvoies du signal audio, et 
representer (111) des composantes sinusoidales respectives dans des voies respectives a une frequence 
20 commune donnee par une representation de la frequence commune donnee (fcom) et par une representation 

des amplitudes respectives (A,AA) des composantes sinusoidales respectives a laf requence commune donnee. 

2. Precede de codage selon la revendication 1 , dans lequel la representation des amplitudes respectives (A,AA) 
comprend une amplitude moyenne (A) et une amplitude de difference (AA). 

25 

3. Precede de codage selon la revendication 1, dans lequel la representation des amplitudes respectives (A,AA) 
comprend une amplitude maximale (A) et une amplitude de difference (AA). 

4. Precede de codage selon la revendication 1 , dans lequel des frequences nen communes sent codees en tant que 
30 des frequences communes ou la representation de Tamplitude comprend une indication pour indiquer la au moins 

une vole dans laquelle la frequence ne se presente pas. 

5. Precede de codage selon la revendication 1, dans lequel, outre les frequences communes, des frequences non 
communes sent codees independamment. 

35 

6. Precede selon la revendication 5, dans lequel les frequences non communes sent groupees dans le flux audio code 
dans un bloc separe. 

7. Precede selon la revendication 6, dans lequel les frequences communes sent groupees et sent incorporees dans 
40 le signal audio code precedant le bloc de frequences non communes. 

8. Precede selon la revendication 6, dans lequel les parametres des composantes sinusoidales aux frequences com- 
munes sent incerpores dans une ceuche de base et les parametres des sinusoTdes a des frequences non communes 
dans une ceuche d'enrichissement. 

45 

9. Precede selon la revendication 1 , dans lequel le precede comprend retapeconsistantacombinerdes representations 
de puissance ou d'energie respectives des au moins deuxvoies pour obtenir une representation commune et dans 
lequel I'etape consistant a detenniner les frequences communes est executee sur la base de la representation 
commune. 

50 

10. Precede selon la revendication 9, dans lequel I'etape de combinaison comprend I'addition des spectres de puis- 
sances des au moins deux voies et dans lequel la representation commune est un spectre de puissances commun. 

11. Precede selon la revendication 1, dans lequel des parametres de frequence et d'amplitude sent incorpores dans 
55 une ceuche de base et I'amplitude delta est incorporee dans une couche d'enrichissement. 

12. Precede selon la revendication 1, dans lequel des phases respectives des sinusoTdes respectives a la frequence 
commune donnee sont determinees et dans lequel une representation des phases respectives est incorporee dans 
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le signal audio code. 

13. Precede selon la revendication 12, dans lequel la representation des phases respectives comprend une phase 
moyenne et une phase de difference. 

5 

14. Proc6d6 selon ia revendication 12, dans lequel la representation des phases respectives comprend une phase de 
la voie avec une amplitude la plus elevee et une phase de difference. 

15. Precede selon la revendication 12, dans lequel la representation des phases respectives estseulement incorporee 
^0 dans le signal pour des sinusoides ayant une frequence jusqu'a une frequence de seuil donnee. 

16. Precede selon la revendication 15, dans lequel la frequence de seuil donnee est de I'ordre de 2 kHz. 

17. Precede selon la revendication 12, dans lequel la representation des phases respectives estseulement incorporee 
15 dans le signal pour des sinusoTdes ayant une difference d'amplitude avec au moins une des autres voies jusqu'a 

un seuil d'amplitude donne. 

18. Procede selon la revendication 17, dans lequel le seuil d'amplitude donne est egal a 1 0 dB. 

20 19, Codeur (11) pour coder un signal audio a au moins deux voies (L, R), le codeur comprenant: 

des moyens (110) pour determiner des frequences communes (fcom) ^^^^ 'es au moins deux voies (L, R) du 
signal audio, lesquelles frequences communes se presentent dans au moins deux des au moins deux voies 
du signal audio, et 

25 des moyens (111) pour representer des composantes sinusoTdales respectives dans des voies respectives a 

une frequence commune donnee par une representation de la frequence commune donnee (fcom) ®^ une re- 
presentation des amplitudes respectives (A,AA) des composantes sinusoTdales respectives a la frequence 
commune donnee. 

30 20. Appareil (1) de transmission ou d'enregistrement, I'appareil comprenant 

une unite d'entree (10) pour recevoir un signal audio (S) a au moins deux voles (L, R), 

un codeur (11) selon la revendication 19 pour coder le signal audio (S) pour obtenir un signal audio code ([S]), et 
une unite de sortie pourfoumir le signal audio code ([S]). 

35 21 . Signal audio code ([S]) representant un signal audio a au moins deux voies (L, R), le signal audio code comprenant: 

des representations des frequences communes (fcom)' lesquelles frequences communes representent des 
frequences qui se presentent dans au moins deux des au moins deux voies du signal audio [S], et 
pour une frequence commune donnee (fcom)> ^^i^^ representation des amplitudes respectives (A,AA) represen- 
40 tant des composantes sinusoTdales respectives dans des voies respectives a la frequence commune donnee. 

22. Support de memoire (2) ayant memorise surcelui-ci un signal selon la revendication 21. 

23. Proceder de decodage (31) d'un signal audio code ([S]), le procede comprenant: 

45 

la reception (31) du signal audio code ([8]) representant un signal audio a au moins deux voles (L, R), le signal 
audio code comprenant des representations des frequences communes (fcom)' lesquelles frequences commu- 
nes representent des frequences qui se presentent dans au moins deux des au moins deux voies du signal 
audio [S], et pour une frequence commune donnee (fcom)' une representation des amplitudes respectives (A,AA) 
50 representant des composantes sinusoTdales respectives dans des veies respectives a la frequence commune 

donnee, et 

la generation (31) des frequences communes aux amplitudes respectives dans les au moins deux voies (L, R) 
pour obtenir un signal audio decode (S'). 

55 24. Decodeur (31) pour decoder un signal audio code ([S]), le decodeur comprenant: 

des moyens (31) pour recevoir le signal audio code ([S]) representant un signal audio a au moins deux voies 
(L, R), le signal audio code comprenant des representations des frequences communes (fcom)' lesquelles 
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frequences communes representent des frequences qui se presentent dans au moins deux des au moins deux 
voies du signal audio [S], at pour une frequence commune donnee (fcom)' une representation des amplitudes 
respectives (A,AA) representant des composantes sinusoTdales respectives dans des voies respectlves a la 
frequence commune donnee, et 

des moyens (31) pour generer les frequences communes aux amplitudes respectives dans les au moins deux 
voies (L, R) pour obtenir un signal audio d6cod6 (S'). 

25. Recepteur ou appareil de reproduction (3), I'appareil comprenant: 

une unite d'entree (30) pour recevoir un signal audio code ([S]), 

un decodeur (31)selon la revendication 24 pour decoder le signal audio code ([S]) pour obtenir un signal audio 
decode (S'), et 

une unite de sortie (32) pourfournir le signal audio decode (S'). 
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